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Abstract 



Top-quark physics is one of the main fields of investigation at the Tevatron 
accelerator and, ultimately, at the LHC We perform a phenomenological 
analysis of ti events at hadron colliders, with a focus on observables relying 
on bottom-quark fragmentation in top-quark decay. In particular, we 
investigate the i?-lepton invariant-mass distribution in the dilepton channel 
and give an estimate of the contribution of bottom fragmentation to the 
Monte Carlo uncertainty on the top-quark mass reconstruction. 



1 Introduction 



Bottom-quark fragmentation in top decay (t — > bW) is one of the main sources of 
uncertainty in the measurements of the top-quark properties, such as its mass. In fact, 
6-quark fragmentation enters in the uncertainty on the 6-energy scale, contributing to 
the Monte Carlo systematics on the top-mass reconstruction (see, e.g., the top quark 
analyses from CDF [IHS] and DO [I] at the Tevatron accelerator). 

Monte Carlo generators, such as the general-purpose HERWIG [5] and PYTHIA 
[6] codes, implementing hard-scattering processes, parton showers, hadronization and 
underlying event, are widely used to simulate ti events at hadron colliders. In particular, 
HERWIG and PYTHIA simulate the hadronization transition according to the cluster [7] 
and string [8] models^, respectively, containing a few parameters which need to be tuned 
to the data, e.g., from LEP or SLD experiments. 

In order to estimate the contribution of bottom fragmentation to the Monte Carlo 
systematic error on top-quark observables, one typically compares the results yielded by 
the two codes and varies the hadronization parameters describing the b — > B transition 
within suitable ranges. At the LHC, it is worthwhile mentioning the study [11] , where it 
was proposed that one could reconstruct the top-quark mass in the dilepton channel, by 
using the decays B — >■ J/ip and J/ip — > B being a b- flavoured hadron. The top 

mass was then fitted from the peak value of the mj/^i or invariant-mass spectra, 
I being a charged lepton in W decay W — > tv. This analysis estimates that, in the 
phase of luminosity 10 5 pb" 1 , after setting suitable cuts on transverse momenta and 
rapidities of final-state leptons, one can reconstruct the top mass with an error Am t ~ 
1 GeV. The contribution of 6-fragmentation to Am t is found to be about 600 MeV and 
is estimated by using the PYTHIA generator along with the Peterson fragmentation 
function^], varying the e parameter in the range (5.0 ±0.5) x 10~ 3 . At the Tevatron, the 
recent CDF analysis [3] identifies jets containing a candidate muon from semileptonic 
5-decays (so-called 'soft muon 6-tagging') and measures the top mass by using the 
invariant mass m^, with £ still coming from VT-boson decay. The overall Monte Carlo 
uncertainty, due to the modelling of ti production and decay in HERWIG and PYTHIA, 
including ^-fragmentation as welg was estimated to be Am t ~ 2.1 GeV [3]. 

From the point of view of Monte Carlo generators, however, the default parametriza- 

1 As an option, PYTHIA allows one to interface its showers to fragmentation functions, such as the 
Bowler [9] or Peterson [10] models. 

2 The Peterson fragmentation function reads: D(x,e) = A/[x(l — l/x — e/(l — x))], where x is the 
hadron energy fraction, A a normalization constant and e a parameter to be tuned to the experimental 
data. 

3 The uncertainties due to the treatment of initial- and final-state radiation were, however, calculated 
separately from the Monte Carlo systematic error. 
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tions of both HERWIG and PYTHIA are unable to fit LEP and SLD data on 5-hadron 
production at the Z° pole [12]. In Ref. [12] the cluster and string models were tuned 
to such data: after the fits, PYTHIA managed to describe the 5-energy spectrum very 
well, whereas HERWIG was only marginally consistent. 

Following |12j . in this paper we wish to perform a phenomenological study of ti 
events at hadron colliders, taking particular care about observables relying on 6-quark 
fragmentation in top decay, and investigate possible discrepancies between HERWIG 
and PYTHIA, which may affect the Monte Carlo systematic error on the top-mass 
reconstruction. In particular, our investigation will be especially useful for the top mass 
extractions according to Refs. [3J,[TT] , as those methods strongly depend on the Monte 
Carlo simulation of the b — ^ B transition in top decay. 

In Section 2 we shall briefly review the results of Ref. [12] on fitting cluster and string 
models to LEP and SLD £>-production data. In Section 3 we shall present a few results 
for observables in ti events depending on the description of 6-quark fragmentation. In 
Section 4, as an example of application of our analysis, we shall try to estimate the 
uncertainty on the extraction of mt in the dilepton channel from a fit of the invariant- 
mass rriBe distribution, I being a lepton from W decay. In Section 5 we shall summarize 
the main results of our study and make some concluding remarks. 



Fitting hadronization models to LEP and SLD B- 
production data 



In this section we shall shortly summarize the main findings of Ref. [12] , where the cluster 
and string models, which simulate hadronization in HERWIG and PYTHIA, were fitted 
to LEP and SLD data on the S-hadron spectrum. Ref. [12J considered data on b- 
flavoured hadron production from the LEP experiments ALEPH [13] and OPAL [TI] . 
and from SLD [15]. In particular, the ALEPH sample was made of B- mesons, whereas 
OPAL and SLD also had a small fraction of 5-baryons, such as the A&. The B spectrum 
in e + e~ — > bb annihilation at the Z° pole was studied in terms of the quantity 

x B = o — • l 1 ) 

In Eq. ([1]), p B and pz are the B and Z° four-momenta, respectively. In Z° rest frame, 
xb = ZEb/itiz, the normalized 5-energy fraction. As for PYTHIA, it was chosen the 
scenario with parton showers ordered in virtuality, with an option to reject non-angular- 
ordered emissions. Angular ordering is correctly satisfied by the HERWIG cascades 
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HERWIG 


PYTHIA 


CLSMR(l) = 0.4 (0.0) 




CLSMR(2) = 0.3 (0.0) 


PARJ(41) = 0.85 (0.30) 


DECWT = 0.7 (1.0) 


PARJ(42) = 1.03 (0.58) 


CLPOW = 2.1 (2.0) 


PARJ(46) = 0.85 (1.00) 


PSPLT(2) = 0.33 (1.00) 




X 2 /dof = 222.4/61 (739.4/61) 


X 2 /dof = 45.7/61 (467.9/61) 



Table 1: Best-fit hadronization parameters in HERWIG and PYTHIA, after comparing 
with the 5-hadron energy spectrum measured at OPAL, ALEPH and SLD, along with 
the x 2 P er degree of freedom. In brackets, we quote the default values of such parameters. 

The default parametrizations of HERWIG and PYTHIA were unable to acceptably 
reproduce such data, yielding x 2 /dof = 793.4/61 and 467.9/61, respectively. In p2], the 
two event generators were therefore fitted to the xb spectra: the choice of the authors 
was to tune only parameters associated with hadronization and leave unchanged the ones 
related to hard scattering and parton showers. As pointed out in |12] , whenever one fits 
just one measured quantity, such as xb, the risk is that one may spoil the comparison 
with other observables, e.g., light-flavour fragmentation. Therefore, the fits performed 
in [T2] are not an official tuning of HERWIG and PYTHIA, but just an attempt to 
understand whether the description of heavy-flavour fragmentation could be improved. 

Table [1] summarizes the results of such fits: the x 2 P er degree of freedom refers to all 
data points, as if they were coming from one single experiment. In HERWIG, one fitted 
CLSMR(l) and CLSMR(2), namely the Gaussian smearing of the hadron direction with 
respect to the parent quark, PSPLT(2), a parameter ruling the mass spectrum of b- 
flavoured cluster decays, CLPOW, controlling the yield and meson/baryon production, 
and DECWT, determining the decuplet/octet ratio. As for PYTHIA, the three fitted 
parameters, namely PARJ(41), PARJ(42) and PARJ(46), are the a, b and r quantities 
in the Lund/Bowler fragmentation function |6j[9]: 

^ ^Lr^ 1 - expt-Kv^), (2) 

rrib and my being the 6-quark mass and the 5-hadron transverse mass respectively. From 
Table [H we learn that, after the fit, PYTHIA reproduces pretty well the data, while 
HERWIG is only marginally consistent with the xb spectra, although its description of 

4 Ref. [12] used HERWIG 6.506 and PYTHIA 6.220. The latest FORTRAN versions do not actually 
present any new features which may change the conclusions of Ref. |12) . 
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Figure 1: Data from LEP experiments on the B-hadron spectrum in e + e~ annihilation, 
along with HERWIG and PYTHIA, according to their default versions and with the 
hadronization models tuned to such data. 

the data is much better with respect to the default parametrization. The comparison 
between data, default and tuned HERWIG and PYTHIA is presented in Fig. [TJ We 
note that PYTHIA, after the tuning, gives an excellent description of the data through- 
out all xs-range, while the HERWIG prediction, even after fitting the cluster model, is 
still below the data around the peak and above the data for middle values of xb- The 
problems exhibited by HERWIG when its predictions are compared withthe xb distri- 
butions, have been mostly fixed in the object-oriented version HERWIG++ [IE], which 
implements an improved cluster model. In fact, Ref. [19] showed that it is enough tuning 
the shower cutoff to obtain a rather good fit of the SLD data. Although the employ- 
ment of HERWIG++ and of the corresponding PYTHIA 8 code [20], written in C++, 
should be recommended, and not only for the better description of I?-hadronization, the 
FORTRAN versions of these generators are still widely used. In particular, FORTRAN 
HERWIG provides the MC@NLO code [21], which implements the hard-scattering pro- 
cess at next-to- leading order (NLO), with parton showers and hadronization. Moreover, 
the so-called matrix-element generators, such as, for example, the ALPGEN [22J or Mad- 
Graph [23] programs, often employed to study backgrounds to tt events, are interfaced 
to FORTRAN versions of HERWIG and PYTHIA for showers and hadronization. It 
is therefore still useful trying to improve the 6-fragmentation sector in the FORTRAN 
generators and compare their results for a few observables relevant to top-quark decay. 
The best fits presented in Table [1] will be the starting point for the phenomenological 
analysis which we shall carry out for ti events at hadron colliders. 

Before closing this section, we point out that heavy-quark energy distributions can 
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also be obtained by using resummed calculations, such as Ref. [21] for e + e~ annihi- 
lation, Ref. [251 EE] for top decays and Ref. [27J for H — >■ bb, H being the Standard 
Model Higgs boson. Such computations, based on the perturbative-fragmentation for- 
malism [28J, resum soft/collinear logarithms with an accuracy which is usually higher 
than parton-shower algorithms: therefore, comparisons with resummations, as done 
in [12], are useful to validate Monte Carlo generators and understand the role played by 
subleading logarithms. However, resummed computations are too inclusive to allow a 
complete investigation of final states. Also, they still need to be supplemented by phe- 
nomenological non-perturbative fragmentation functions, such as the models [9l[TC)> 29j . 
to be comparable with experimental data on hadron production. In the following, we 
shall not only investigate the 5-hadron energy fraction, but also observables for which 
resummed calculations are not currently available. Monte Carlo generators, giving an 
exclusive description of final states, are thus the best available tool to carry out our 
study. 



3 Top-quark decay observables at hadron colliders 

Given the HERWIG and PYTHIA best fits presented in the above Section, and relying 
on the universality of the hadronization transition, we wish to make predictions for top- 
decay observables depending on 6-quark fragmentation, taking particular care about 
quantities which might be useful for Tevatron or LHC phenomenology. 



3.1 5-hadron energy fraction in top decay 

Let us consider top quark decay: 

t(p t )^b(p b )W(p w )X(p x ), (3) 

where X stands for extra parton radiation, and the subsequent transition b(pb) — > B(ps)- 
A straightforward extension of the xb variable in Eq. (pQ) is the following quantity: 

T = I 2 Pb ■ Pt U) 

B _. 2 / 2 i 2/2 2 V / 

1 — m^y/mf + mymf m£ 

xb is a Lorentz-invariant variable which corresponds to the normalized I?-energy fraction 
in top rest frame. At LEP and SLD, since the e + e~ collision takes place at the Z° pole, 
the laboratory coincides with the Z° rest frame. On the contrary, at the Tevatron or 
LHC, the laboratory frame is not the top-quark rest frame, and therefore, in order to 
measure the xb quantity, one would need all four components of top and B momenta. 
Such a measurement is obviously not straightforward; however, it is still interesting 
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Figure 2: 5-hadron spectrum in top decay, for m t = 175 GeV, according to HERWIG 
and PYTHIA, using the default parametrizations and after fitting cluster and string 
models to LEP and SLD data. 



presenting the xb spectrum in top decay, in such a way to compare HERWIG and 
PYTHIA before and after the fits to e + e~ data and understand how much i B depends 
on the top quark mass. We point out that, as already observed in [I21I2S], neglecting 
the top width, which is a reasonable approximation as long as experimental analyses set 
cuts of order 10 GeV or higher on the energy of final-state jets [30], the xb spectrum is 
roughly independent of the production process. Therefore, in such an approximation, 
our results will be valid for both Tevatron and LHC: unless stated differently, the actual 
plots which will shall present are anyway obtained running HERWIG and PYTHIA in 
the LHC environment, i.e. pp collisions at y/s = 14 TeV. 

Fig. [2] exhibits the 5-energy distribution in top decay yielded by PYTHIA and 
HERWIG, for m t = 175 GeV, using default and tuned parametrizations. As in the e + e~ 
case, the shapes of the spectra are remarkably modified once we fit string and cluster 
models: after the tuning, the distributions are somewhat narrower and shifted towards 
higher values of xb- The comparison of the two tuned codes is similar to what observed 
in Fig. [TJ HERWIG yields a broader distribution and is above PYTHIA for very large 
and middle values of xb, whereas it is below PYTHIA around the peak and at small 
xb- In Fig. [3] we present the same spectra, but varying the top mass from 171 to 179 
GeV, and learn that xb exhibits negligible dependence on the top mass, independently 
of the hadronization model which one uses. This is an interesting result: if one were 
able to measure xb, it would be an ideal quantity to fit 6-fragmentation parameters, 
with almost no dependence on the top-quark mass. However, as said above, for the time 
being, xb in top decays is a difficult observable to measure. 
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Figure 3: xb spectrum in top decay for m t = 171 and 179 GeV, according to default 
and tuned HERWIG (a) and PYTHIA (b). 

3.2 5-lepton invariant-mass distribution in the 
dilepton channel 

In this subsection, we investigate the B-lepton invariant-mass {tubi) distribution in 
the dilepton channel, where B is a 5-flavoured hadron coming from top decay and i a 
charged lepton in W decay (W —> tv). In fact, such a quantity is closely related to 
invariant masses m^i and mj/^i, where the J ftp's and /i's come from B decays, used in 
Refs. to fit the top mass at Tevatron and LHC, respectively 

The met invariant mass is another boost-invariant observable, just relying on top 
decay and not depending on the top-production phase. Such a quantity was already 
studied in Refs. [3DE2] , in order to investigate the impact of matrix-element corrections 
to simulations of top decays in HERWIG [33]. Ref. [32] also checked that the tubi 
distribution is roughly the same at the Tevatron and at the LHC, thus confirming that 
it is indeed independent of the production mechanism, which is mainly qq — > ti at the 
Tevatron and gg — > ti at the LHC. 

As done in the previous subsection for the xb quantity, we first compare HERWIG 
and PYTHIA for a given value of m t and then we vary the top mass. Fig. H] presents 
the default and tuned ttibi spectra for m t = 175 GeV: in both codes, the fit to the e + e~ 
data has the effect to shift the distributions towards larger invariant- mass values S As 
for the comparison between HERWIG and PYTHIA, the shapes of the curves yielded 

5 The results in Figs. 2] and [5] look different from the spectra presented in [31 ,32 , which were obtained 
using an unofficial preliminary HERWIG version, wherein a few bugs were later found. The effect of 
matrix-element corrections to the HERWIG simulation of top decays found in j31][32] is nevertheless 
still confirmed, when using the latest versions, with the bugs fixed. 
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Figure 4: 5-lepton invariant-mass distribution, in top decay and in the dilepton channel, 
according to tuned and default HERWIG and PYTHIA, for m t = 175 GeV. 
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Figure 5: m Bi spectrum in top decay for m t = 171 and 179 GeV, according to default 
and tuned HERWIG (a) and PYTHIA (b). 



by the two generators exhibit visible differences: HERWIG is above PYTHIA around 
the peak and below at small msi- At large msi the discrepancy becomes very little, 
with HERWIG still giving a slightly higher differential cross section. 

Looking at Fig. El we learn that the behaviours of HERWIG and PYTHIA spectra 
with respect to m t are rather similar, in both default and tuned versions. Increasing 
m t shifts the tubi spectrum towards higher invariant masses, as one would expect on 
physical grounds. We can anticipate that in Section 4 we shall thoroughly study the 
above spectra: we shall compute the Mellin moments and discuss of a possible extraction 
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Figure 6: Transverse momentum distributions of B-hadrons in top decay at the LHC, 
according to tuned and default HERWIG and PYTHIA, for m t = 175 GeV. Pt,b is 
evaluated in the laboratory frame. 



of the top mass from a fit of the mean value (rriBt)- 



3.3 5-hadron transverse momentum spectrum 

In this subsection, we investigate the transverse momentum of the I?-hadron in top decay 
(Pt,b) m the laboratory frame. Clearly, such an observable is not Lorentz invariant and, 
unlike xb and mse, it does not depend only on the decay, but the production phase is 
essential in determining its spectrum. However, it is still useful to study such a quantity: 
a measurement of pt,b is more feasible than xb and, as it happens, e.g., for the W/Z 
transverse momentum in Drell-Yan processes, can be useful to study the experimental 
acceptance for top events [31] . In the Pt,b spectrum, the hadronization parameters will 
certainly play a role, although, as explained above, a number of other quantities, in 
particular initial-state radiation, are relevant. 

In Fig. [6] we present the comparison between HERWIG and PYTHIA, default and 
tuned, for m t = 175 GeV at the LHC; in Fig. Owe plot the pt,b distribution for m t = 171 
and 179 GeV. As far as this observable is concerned, the discrepancy between HERWIG 
and PYTHIA looks smaller than for xb and mse, with PYTHIA being slightly above 
HERWIG at small Pt,b and below for middle-large transverse momenta. The effect of 
the tuning to LEP and SLD data is similar for both generators: less events at small x b 
and at the peak, but a higher differential cross section for Pt,b > 60 GeV. To be more 
quantitative, the average value of pt,b for m t = 175 GeV reads (pt,b) — 50.66 and 50.59 
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GeV, for default HERWIG and PYTHIA respectively. After tuning cluster and string 
models, HERWIG yields (p T)B ) ~ 53.01 GeV, whereas PYTHIA {p TjB ) ~ 52.20 GeV. As 
for the top-mass dependence of this observable, a higher m t results in less events around 
the peak value, and more with a large transverse-momentum 6-flavoured hadron. 

Since we observed that Pt,b is not Lorentz-invariant and depends on the ti pro- 
duction mechanism, it is interesting investigating such a quantity even at the Tevatron 
accelerator, i.e. pp collisions at 1.96 GeV, where the production mechanism is mostly 
qq — > ti, whereas gluon-gluon fusion dominates at the LHC. In Fig. [8] we present the B 
transverse-momentum distribution at the Tevatron, for m t = 175 GeV, given by default 
and tuned HERWIG and PYTHIA. We learn that at Tevatron energies the effect of the 
fits is qualitatively similar to what found at the LHC: less events are simulated at small 
transverse momentum and about the peak, while there are more top-decay I?-hadrons 
at large Pt,b- The mean values are (pt,b) — 47.91 and 47.66 GeV, according to default 
HERWIG and PYTHIA, whereas, when using the parametrization in Table HJ HERWIG 
gives (pr <B ) ~ 50.23 and PYTHIA 49.02 GeV. Hence, at both Tevatron and LHC, the 
two codes are in better agreement when using the default parametrizations, while the 
discrepancy gets larger after the tuning. 

In Fig. IH] we use instead the two codes with the fragmentation parameters tuned to 
the LEP and SLD data: in Fig. [9] (a) we show the Pt,b spectrum for mt — 171 and 179 
GeV, while, for the sake of comparison, in Fig. [9] (b) we present HERWIG and PYTHIA 
predictions at the Tevatron and at the LHC, using m t = 175 GeV. The dependence of 
the pt,b spectrum on the top mass is like the one observed at LHC: a higher m t shifts 
the B transverse- momentum distribution towards larger Pt,b- As for the comparison 
Tevatron/LHC, it shows indeed that we are dealing with an observable depending on the 
ti production stage and on the boost from the laboratory frame to the top rest frame, 
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Pt,b 



Figure 8: As in Fig. El but at the Tevatron accelerator. 
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Figure 9: (a): Pt,b spectrum for m t = 171 and 179 GeV, according to tuned versions 
of HERWIG and PYTHIA at the Tevatron. (b): Comparison of LHC and Tevatron 
B-hadron transverse momentum spectra, given by tuned HERWIG and PYTHIA, for 
m t = 175 GeV. 

where top decay is performed. At the Tevatron, due to the lower available energy, most 
events are simulated for pt,b < 70 GeV; at the LHC, the large-pr,B tail is instead more 
relevant with respect to the Tevatron. 
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4 Extracting the top mass from the tubi spectrum 



As the .B-lepton invariant mass is a Lorentz-invariant quantity, depending only on top 
decay and visibly sensitive to the top mass (see Fig. EJ, we can think of using vtibi to fit 
m t . Also, after convoluting with the B — > J/ip or B — > fi spectra, one will obtain 
the mj/^i and distributions, employed in Refs. [HUH] to extract m t at the LHC and 
at the Tevatron, respectively. 

Ideally, if we had data on rrise, we may directly use them to validate the Monte 
Carlo tools and fit the cluster /string models. For the time being, we try to express the 
tube spectra in terms of the top mass by computing the first few Mellin moments in the 
range 171 GeV < m t < 179 GeV. After observing that the fits to LEP and SLD do have 
a strong impact on top-decay observables depending on 6-fragmentation, hereafter we 
shall stick to the best-fit parametrizations quoted in Table [TJ 



m t (GeV) 


(m Bi ) (GeV) 


(rn^) (GeV 2 ) 


(m% e ) (GeV 3 ) 


«e) (GeV 4 ) 


171 


78.39 


7.01 x 10 3 


6.82 x 10 5 


7.02 x 10 8 


173 


79.52 


7.22 x 10 3 


7.12 x 10 5 


7.43 x 10 8 


175 


80.82 


7.45 x 10 3 


7.46 x 10 5 


7.91 x 10 8 


177 


82.02 


7.67 x 10 3 


7.79 x 10 5 


8.37 x 10 8 


179 


83.21 


7.89 x 10 3 


8.13 x 10 5 


8.86 x 10 8 



Table 2: First four moments of the tubi spectrum in top decay, yielded by HERWIG, 
after tuning the cluster model to ALEPH, OPAL and SLD data, for 171 GeV < m t < 179 
GeV. 



mt (GeV) 


(m m ) (GeV) 


(ml,) (GeV 2 ) 


(mil) (GeV 3 ) 


(m m ) (GeV 4 ) 


171 


77.17 


6.85 x 10 3 


6.62 x 10 5 


6.81 x 10 8 


173 


78.37 


7.06 x 10 3 


6.94 x 10 5 


7.23 x 10 8 


175 


79.55 


7.27 x 10 3 


7.25 x 10 5 


7.67 x 10 8 


177 


80.70 


7.48 x 10 3 


7.56 x 10 5 


8.12 x 10 8 


179 


81.93 


7.71 x 10 3 


7.91 x 10 5 


8.61 x 10 8 



Table 3: As in Table El but using the PYTHIA event generator. 
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We present the first four moments yielded by HERWIG and PYTHIA in Tables [2] 
and El respectively 0. From the comparison, we learn that HERWIG sistematically 
yields moments which are larger than PYTHIA, as was already predictable looking at 
Fig. HI Furthermore, according to both codes, the moments of the m B i spectrum linearly 
increase with respect to the top mass. 

In order to give an estimate of the Monte Carlo uncertainty due to modelling 6-quark 
fragmentation, we perform a linear fit of the average value (rriBi) in terms of m t , by 
means of the least-square method 0. The linear fit works very well and the best fits are 
the following: 

(m Bl ) u ~ -25.31 GeV + 0.61 m t , 5 = 0.043 GeV, (5) 
(m Be ) P ~ -24.11 GeV + 0.59 m t , S = 0.022 GeV, (6) 

where 5 is the mean square deviation in the fit and the subscripts H and P refer to 
HERWIG and PYTHIA, respectively. The best-fit straight lines, as a function of m t , 
are plotted in Fig. [lOj we see that, for a given measurement of (mse), the extracted 
values of mt can be quite different according to whether one uses HERWIG or PYTHIA. 
In fact, as Tables |2] and [3] tell us that the typical difference between HERWIG and 
PYTHIA is {tubi} — 1.2 — 1.3 GeV, the corresponding uncertainty inferred on mt can 
be up to about Amt — 2 GeV, given the slopes of the straight lines in Fig. [TUJ Such a 

6 We remind that, since we plotted (l/a)(da/dm,Bi) everywhere, our distributions are normalized to 
unity. 

7 Of course, given the numbers in Tables [5] and G2 a linear fit will work even for the higher Mellin 
moments of the rriBt spectrum. 
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value of Am t is clearly quite large, and much above the 600 MeV quoted in [TT], thus 
showing that probably such an error, obtained varying the e parameter in the Peterson 
hadronization model, may have been underestimated. Our Am t is instead closer to the 
2.1 GeV determined in [3] as the Monte Carlo systematic error when measuring m t from 
the spectrum at CDF. 

Our analysis, however, assumes that one is indeed able to measure the full m B i 
spectrum, which is obviously quite ideal. A more realistic estimate of Am t due to the 
6-quark hadronization can be obtained if we discard the low- and high-m^ tails and 
restrict ourselves, e.g., to the range 50 GeV < m Bi < 120 GeV. In this range, we 
obtain the truncated moments of the m B £ spectrum presented in Tables H] and The 
discrepancy between HERWIG and PYTHIA is clearly much smaller after we cut the 
tails of the spectrum; the linear relation of the moments with respect to m t is nonetheless 
still preserved. 



m t (GeV) 


(m Be ) (GeV) 


{ml,) (GeV 2 ) 


{ml,) (GeV 3 ) 


{m" m ) (GeV 4 ) 


171 


84.64 


7.52 x 10 3 


6.97 x 10 5 


6.70 x 10 8 


173 


85.01 


7.59 x 10 3 


7.06 x 10 5 


6.81 x 10 8 


175 


85.43 


7.66 x 10 3 


7.17 x 10 5 


6.94 x 10 8 


177 


85.78 


7.72 x 10 3 


7.25 x 10 5 


7.04 x 10 8 


179 


86.09 


7.78 x 10 3 


7.32 x 10 5 


7.13 x 10 8 



Table 4: Truncated moments of the vtibi spectrum, according to HERWIG in the range 
50 GeV < m Bi < 120 GeV. 



m t (GeV) 


(m B t) (GeV) 


{ml,) (GeV 2 ) 


(mil) (GeV 3 ) 


{ml E ) (GeV 4 ) 


171 


84.42 


7.49 x 10 3 


6.93 x 10 5 


6.65 x 10 8 


173 


84.79 


7.55 x 10 3 


7.02 x 10 5 


6.77 x 10 8 


175 


85.13 


7.61 x 10 3 


7.10 x 10 5 


6.87 x 10 8 


177 


85.45 


7.67 x 10 3 


7.18 x 10 5 


6.97 x 10 8 


179 


85.77 


7.73 x 10 3 


7.26 x 10 5 


7.06 x 10 8 



Table 5: As in Table HI but using the PYTHIA event generator. 



14 



87 



86 

s 
6 

v 85 



84 



170 172 174 176 178 180 

Figure 11: Linear fits of (mBe), as a function of m t , using HERWIG and PYTHIA in 
the range 50 GeV < m B i < 120 GeV. 

As done when considering the full mse range, we try to express the average value 
(rriBt) in terms of mt, according to a straight line. The best linear fits read: 

(m Be ) H ~ 53.33 GeV + 0.18 m t ; 5 = 0.034 GeV, (7) 
(m Bi ) P ~ 55.83 GeV + 0.17 m t ; 5 = 0.020 GeV. (8) 

The corresponding straight lines are plotted in Fig. [TTJ Since the found discrepancy 
between HERWIG and PYTHIA is about A(m Be ) ~ 200 - 300 MeV, and given the 
slopes of the straight lines in Fig. [TTJ, the induced uncertainty on the top mass, thinking 
of extracting it by fitting the mean value (mBe), goes down to Amt — 1.5 GeV. It is 
nonetheless still a quite large value, well above the estimate of the bottom-fragmentation 
contribution to the Monte Carlo error given in [TT] . 

5 Conclusions 

We performed a phenomenological study of bottom quark fragmentation in top-quark 
decay, using HERWIG and PYTHIA, the two most popular general-purpose Monte 
Carlo event generators. We observed that the default parametrizations are unable to 
reproduce B-hadron production data from ALEPH, OPAL and SLD, and therefore we 
used an 'unofficial' fit of the hadronization cluster and string models, following [12], in 
such a way to improve the description of such data. 

We used this tuning to make predictions for a few observables in ti events, depending 
on modelling 6-fragmentation in top decay, and found that the fits to e + e~ data have a 
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remarkable impact even on top-decay observables. Moreover, HERWIG and PYTHIA 
results still exhibit visible discrepancies, which depend on the different quality of the fits 
to LEP and SLD data. We studied the £>-energy fraction in top decay, which turned out 
to be roughly independent of the top mass, the 5-lepton invariant mass uibi, exhibiting 
relevant dependence on m t , and the B transverse momentum in the laboratory frame, 
which can still be useful to determine the experimental acceptance for ti events, although 
it is not Lorentz-invariant and depends on the top-production mechanism as well. 

Among these quantities, we have taken particular care about mse, whose spectrum is 
also interesting for the purpose of the analyses [5|fTT] , where the top mass is reconstructed 
by using final states with leptons and J/ip or muons.. We calculated the Mellin moments 
of the mse distribution and parametrized the average value {rriBe) as a linear fit of the top 
mass. The found discrepancies between HERWIG and PYTHIA result in an uncertainty 
on the top mass, assuming that one can extract it from a fit of {mse), which can be up 
to Am t — 2 GeV. The Monte Carlo error due to modelling 6-fragmentation decreases 
down to Am t ~ 1.5 GeV, if we restrict our analysis to the region around the invariant- 
mass peak, namely 50 GeV < mm < 120 GeV. As such estimates are quite large, our 
study confirms that bottom fragmentation will play a crucial role in top-quark analyses 
at Tevatron and LHC and that having event generators reliably describing the b — > B 
transition will be fundamental. 

A possible extension of our work clearly consists in employing the object-oriented 
versions of HERWIG and PYTHIA, written in C++. In fact, the discrepancies here 
emphasized mainly depend on the fact that, even after the fits, FORTRAN HERWIG 
is only marginally consistent with 5-hadron data from LEP and SLD. Therefore, since 
the preliminary results presented in [19J, obtained by comparing an early version of 
HERWIG++ with the SLD 5-data, look encouraging, we believe that a lower Am t can 
eventually be obtained when using the C++ programs. However, the analysis carried 
out throughout this paper will still be valid as long as one uses MC@NLO or matrix- 
element generators interfaced to FORTRAN HERWIG and PYTHIA for showers and 
hadronization. 

We also stressed the fact that the fits carried out in Ref. [T2] and reviewed in Section 2 
just account for the 5-hadron data and may spoil the comparison with other observables, 
e.g., light-hadron data. In perspective, the advanced fitting code Professor [35] should 
be a very useful tool, as it is capable of improving the description of the xb spectrum, 
but without spoiling too much possible agreement with other data. The use of the 
Professor program is currently in progress. 

Furthermore, let us point out that, whereas in our study we aimed at predicting 
a few top-decay observables taking non-perturbative information from e + e~ annihila- 
tion, possible hadron-collider data, for example on the m,Bt distribution, should be very 
useful to validate tools such as HERWIG or PYTHIA. This way, one could directly fit 
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such spectra and tune the parameters of cluster and string models, without necessarily 
relying on the fits to the e + e~ data. For the time being, we still believe that it can be 
nevertheless very interesting reconsidering the studies [5|ITT]. as they strongly rely on the 
Monte Carlo treatment of bottom fragmentation in top decay, and understand whether 
the results on Am t quoted in [31 [11] should change if one used the tuned versions of 
cluster and string models, as we did throughout this paper. 
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